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(54) Abstract Title 

Measuring the delay of audio stream packets across a network using marker packets 

(57) A system is provided for measuring delays across equipment in a packet network. A test audio signal is 
generated by a signal generator 102. A multimedia terminal 104 receives and digitally encodes the test audio 
signal forming an audio stream packet for transmission onto the network 108. Multimedia terminal 106 
receives and decodes the audio stream packet and regenerates test audio signal which is then looped back to 
the signal generator 102. At various stages (e.g. test signal generation, audio stream packet transmission), 
marker packets-are generated-by_the_packet.generator ^^0 in respqnsejo trigger signals from the signal 
generator 102. Marker packets and audio stream packets are time-stamped^ and logged for ahalysisT Time 
differences between marker and audio stream packets determine delays at various stages. Measurement 
accuracy can be improved by locating the position of the encoded test signal in the audio stream packets. 
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2355901 

MARKER PACKET SYSTEM 
AND METHOD FOR MEASURING AUDIO NETWORK DELAYS 

FIELD OF THE INVENTION 
5 The present invention relates generally to the field of electronic multimedia 

communications and more specifically to a system and method for measuring audio 
signal and packet delays across a network. 

BACKGROUND OF THE INVENTION 

10 With the increased deployment of local area and wide area networks within 

corporations and organizations, new technologies have emerged to take advantage of 
such network deployments. In traditional networks, voice communications are 
facilitated by direct, circuit switched connections; however, such networks are resource 
intensive and costly. Therefore, new technology has arisen that provides for the 

1 5 transmission of voice communications over packet transport networks that were 

originally designed primarily for data transmission. The transmission of voice over 
packet transport networks-introduces new challenges into the design of not only the 
packet transport network but also the multimedia equipment connected to such 
networks to facilitate the voice communications. Voice communications are particularly 

20 sensitive to time delays, therefore it is extremely important to have efficient multimedia 
equipment for receiving and generating audio signals, placing them on the network, as 
well as efficient packet transport network communications equipment and an efficient 
network topology. There is a need in the art for a system and method to measure the 
delays in multimedia equipment and across packet transport networks to aid in the 

25 design, configuration and development of efficient multimedia temiinal equipment, as 
well as the design and development of efficient network equipment and typologies. In 
addition, there is a need in the art for a system and method for effective isolation and 
the removal of network bottlenecks in voice communications. 

30 SUMMARY OF THE INVENTION 

. The present invention provides-a sj^stem and method to measure the delay s-in 
multimedia equipment and across packet transport netv^'orks to aid in. the design; 



S'^SDOCID <GB 23£5&0lA_l_> 



0 



*e occ>„e„ce of i„p« ^ ou^u, audio .ig„a>s. TOe syste™ and .eftod ,oca.esl7 
cap^es. „.os.„..a.,...„e„coded..oanaud.os.^ 

n ,„ork. TT,e p^sem ,„ve„,,„„ can measure fte delay benveen an analog audio 
0 s,„a^ and oc„ of audio 3..a. pae.e. on ne Jo 1 

Detween the occurrence of an audio sh-Mm r^=^^ . 

^ P^*"^^^ o° a network and the subseauenf 

> signal creation at the start to audio .,an»i • ay xrom audio 

tart audio signa] generation by a receiver at the destination 
1 he present invention consists of a preferreH »r,H i. 
u P ^temate embodiments that 

may be used to measure packetized voice delav. «t H «- • 
tran.mitc • -r ^ different points in a network that 

transmits information and data in packets. 

According to one aspect of the present invention there is provided a system fo 

~ng delays across equipment in a packet network comprising: ca) i^^^^^^ 
„forgeneratingtestaudiosign^ 

terminals coupled lo the networli:ar.H(k..;„ . "eiwonc 

*"'«'^8™«'«or for receiving the test aurfi„ 
signals, coding and decoding audio stream Dact„= , ■ ■ ^""i'" 
stream packets on the „e Jk and I * 

coupled to the network ITfter 7 ^ '""'^ ' ""^^^ ~ ' 

nen! I. ■ ' «°»''^>i"e marker packets on the 

network „ response to the trigger sig„.s. (d) a network monitor coupl d to Ure l^ork 

^r^aptu„ngthem.kerpacke.sand.e audio stream packets and Irdin^^ 
^create capnued data, and (e) a packet ^ J^I 

analy^ng the captured data ,o generate and display measurements of delays 

According to another aspect of the present invention, there is provided a system 
or measuang d.,.. across c,.ip,e„, in a packet network co.pHsirg.- (a, a ,2 
.e„er.or.,or generating .s, ,ud,o si,aM,, oneor mor. ...imcdi!,^^^^^^^^ 



coupled to the network and the pulse generator for receiving the test audio signals, 
coding and decoding audio stream packets, transmitting and receiving the audio stream 
packets on the netv^ork and regenerating audio signals; (c) a trigger circuit coupled to 
the pulse generator and the multimedia terminals for receiving regenerated audio signals 
and test audio signals and generating trigger signals; (d) a packet generator coupled to 
the network and the trigger circuit for generating marker packets on the network in 
response to the trigger signals; (e) a network monitor coupled to the network for 
capturing the marker packets and the audio stream packets and recording time stamps to 
create captured data; and (f) a packet analyzer coupled to the network monitor for 
analyzing the captured data to generate and display measurements of delays. 

According to a further aspect of the present invention there is provided a method 
for measuring delays across equipment in a packet network comprising the steps of: (a) 
generating fu-st marker packet on the network contemporaneously with the generation of 
a test audio signal to a network terminal; (b) generating an audio stream packet by the 
network terminal on the network containing an encoded audio signal corresponding to 
the test audio signal; (c) capturing the first marker packet and the audio stream packet 
with associated time stamps corresponding to moment of capture; and (d) subtracting 
the associated time stamps corresponding to the first marker packet and the audio 
stream packet to determine raw delay across the network terminal. 

In addition, according to a further aspect of the present invention, there is 
provided a method for measuring delays across equipment in a network comprising the 
steps of: (a) generating first marker packet on the network contemporaneously with the 
generation of a test audio signal to a first network terminal; (b) generating an audio 
stream packet by fu-st network terminal on the network containing an encoded audio 
signal corresponding to the test audio signal; (c) regenerating the test audio signal on a 
second network terminal in response to receipt of the audio stream packet on the second 
network terminal; (d) generating a second marker packet on network 
contemporaneously with the regenerating of test audio signal on the second network 
terminal; (e) capturing the first marker packet and the second marker packet wdth 
associated time stamps corresponding to moment of capture; and (f) subtracting the 
asspeiated'time stamps corresponding to the first marker packet and the second marker 
packet -to determine end to end delay across the network. 
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In addition, according to a further aspect of the present invention there is 
provided a method for measuring delays across equipment in a network comprising the 
steps of: (a) generating first marker packet on said network contemporaneously with the 
generation of a test audio signal to a first network tenninal; (b) generating an audio 
stream packet by first network tem^inal on said network containing an encoded audio 
signal corresponding to said test audio signal; (c) regenerating said test audio signal on 
a second network temiinal in response to receipt of said audio stream packet on said 
second network terminal; (d) generating a second marker packet on network 
contemporaneously with said regenerating of test audio signal; (e) capturing said first 
marker packet, said second marker packet and said audio stream packets with associated 
time stamps corresponding to moment of capture; (f) subtracting said associated time 
stamps corresponding to said first marker packet and said second marker packet to 
detennme end to end delay across said network; and (g) subtracting the associated time 
stamps corresponding to the second marker packet and the audio stream packet to 
1 5 determme raw delay across the second network terminal. 

BRIEF DESCRIPTION OF THE DRAWINGS 

A detailed description of the prefeired embodiments is provided hereinbel 
with reference to the following drawings in which: 

Figure 1 is a diagrammatic representation of an enviromnem illustrating 
components of the present invention; 

Figure 2 is a diagrammatic representation of system delays that may be 
compensated for in the environment of Figure 1 ; 

Figure 3 is a diagrammatic representation of packets on the network in the 
environmem of Figure 1, with delay (Figure 3 A) and without delay (Figure 3B); 

Figure 4 is a diagrammatic representation of a first alternate embodiment of 
components of the present invention; 

Figure 4A is a diagrammatic representation of a second alternate embodiment of 
components of the present invention; and 

■ Figure 5 is a diagrammatic representation of a third alternate embodimem of 
components ofthe present invention.- ■ •• ■ - .■ 
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DESCRIPTION OF THE PREFERRED EMBODIMENT 

Turning to Figxire 1, a typical environment 1 00 of the present invention is 
shown. The measurement system and method of the present invention uses a signal 
generator 1 02 connected to one or more network devices for generating and receiving 
signals from the network devices. In a preferred embodiment, the network devices are 
network terminals that are sound enabled such that they can generate and receive audio 
signals. Preferably, they are multimedia terminals 104 and 106, which may be personal 
computers as are known in the art with graphical user interfaces, and may run operating 
systems such as Windows 95 or other well known operating systems. Multimedia 
terminals 104 and 106 may employ commercially available sound cards or devices to 
facilitate the audio function. The multimedia terminals 104 and 106 are cormected to a 
packetized network 108 with commercially available network interface cards or 
network interfaces and in a preferred embodiment, multimedia terminals 1 04 and 106 
utilize commercially available software and hardware employing the public standard 
ITU-T Reconrunendation G.71 1 (PCM) coding for digital representation of audio 
signals transmitted or received on network 108. Ahematively, network terminals may 
be dedicated multimedia terminal devices with operating functions coded in hardware 
or firmware. Network 108 may be any type of commercially available packetized 
network, and in the preferred embodiment, network 108 uses Ethernet protocols. 
Alternatively, network 108 may incorporate multiple network segments, bridges, 
routers, wide area network (WAN) and Internet cormections. The invention may be 
adapted in a manner obvious to one skilled in the art to operate on networks using 
protocols other than Ethernet, 

The inventive system and method also uses a component for marker packet 
generation connected to both signal generator 102 and network 108. It also employs a 
component for network monitoring cormected to network 108. The packet generation 
component and the network monitoring component may be combined into a single 
device shown as packet generator and monitor 11 0 in Figure 1 . 

Furthermore, the inventive system and method uses a packet analyzer 1 12, 
which is preferably coupled to the network monitor, for performing the measurement 
and analysis functidns..The aha]y^r .l4'2 have a display foroutput of results 

and memory for storage of measurement results. The analyzer 112 may be embodied in 



softwa,. n^ng on a ^-pical personal computer com,ec,ed ,o the ne^vorking 
momtoring componem by a senal or por, or otter appropriate means 

The basic operational principles of tte preferred embodiment of a,e present 
.nventron are described below. A .est audio signal is generated by signal gene^tor ,02 

r.rs.Tur'^r"'™^"'"'-^"'''"^'''^'^™'^ 

U,e tes audro stgna, to cr^te a cotresponding audio stream packet, and transtnits the 
aud,o stream packet onto netwotic ,08 for ..caption by nrultimedia tenninal 1 06 T.e 
audto stream packet is recognizable by the networking monitoring component and when 

"~-=n^-'0.receivesi..om.bene.work.itdecodes^ 
0 back to aud o and rcEenerates th. ■ , '"^■■gnai 

sen,, H ■ *''"«""'^°'"P"''n'"egene,a.ed audio signal is 

.ent to and recerved by signal genet^tor ,02. A. various stage, marker packets are 
generated by .be packet generator component triggered by *e stgna, generator 1 02 to 

*e delay a. vanous stages. operation of fte system can also be reversed, so that 
he stgna, generator ,02 generates an audio s.gna, for multimedia tet^ina, 106 that is 
.venntally regenerated in a similar manner, on multimedia terminal 104. Optionally 
audto stream packets may aavel in both dit^ctions a. the same time 

The signal generator ,02 is connected ,o the inputs and outputs of U,e 
mu^ttmediatemtin^s ,04 and ,06. preferably audio input U 4 and audio output „ 8 of 

tlTtnT'""' "^-"^-"""'"P- U6 and audio output ,20 of multimedia 
termtna, ,06 for tte generation and reception^f audio signals. n,e signal generator 102 

■n the ar,. I,e stgnal generator , 02 is also connected the packet generation componen. 

ofpacke.genera.orandmo„i.or„0usingst.dardcab,es.Signa,gc„era«,r,02 ' 

'm^; ^isnals Which are input into audio inputs 

"Hand nsofmulumediatenninals ,04 and .06 respectively, and ^so receives the 

regenerated audio signals from fte outputs 1 1 8 and 1 20 of ,h. 

,.™: 1 . '20 of the respective muWmedia 

terminals 104 and 106. 

test audio" T"'' '~ ' ~ ^ 

I rr ^ """"'^ ^" Of a 

-voltage and has pulse shape. which.is conve„ie„..f.r detection. However, the tnvention ' 
.s no. resected to the pulse shaped signal only. OUter signal shapcs are possible Z 



test audio signal is used to initiate a delay measurement, and it is generated repeatedly 
at an adjustable rate by an adjustment in signal generator 102. Preferably, the period of 
repetition is chosen long enough to accommodate the largest possible end-lo-end delays 
that are expected to occur, but short enough to make measurement convenient. No 
overlap of test audio signals in any particular direction should be allowed, such that the 
test audio signal originated by the signal generator 1 02 and sent through multimedia 
terminal 1 04 must be propagated through the network 1 08 and must appear on the 
multimedia terminal 106 audio output 120 before the next test audio signal is generated 
by signal generator 102. The voltage level should be adjusted to nominal audio levels to 
prevent damage to the equipment. In the preferred embodiment, the pulse width of the 
test audio signal is chosen to be 1ms, but larger pulse widths can also be used. Shorter 
pulses than 1 ms are not preferred as they can be hard to detect. The test audio signals 
generated by the signal generator 1 02, as well as the regenerated audio signals from the 
far end (i.e. multimedia terminal 106), are used to initiate the generation of marker 
packets onto the network 108 by the packet generator component of packet generator 
and monitor 110, 

The packet generation cornponent of packet generator and monitor 1 10 is 
adapted and used to generate marker packets on network 1 08 that are unique and easily 
recognizable for capture from the network 108. The nature of the marker packets and 
the audio stream packets is that they can be recognized, captured and analyzed. Marker 
packets can be created, identified and recognized by choosing a specific size for the 
packet, or designating a specific pattern of characters within the packet generator 
component of packet generator and monitor 110. TTie network monitor component of 
packet generator and monitor 1 1 0 is adapted and configured in a manner to capture the 
network traffic containing the marker packets and the audio stream packets. With each 
packet captured, the network monitor component records a time stamp associated vnth 
the packet. The network monitor component of the packet generator and monitor 1 1 0 
captures the network traffic, and may produce a hexadecimal trace file that may be 
analyzed off-line by the analysis tool 1 12. Altematively, the network monitor 
component may funnel the captured- traffic and time stamps directly to the analysis tool 
H2. . . • ' ' - . 

When the signal generator 102 generates a test audio signal- thai is input on- 
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multimedia terminal 1 04, a trigger signal, which may be the same test audio signal, is 
simultaneously sent to the packet generator componeiit of packet generator and momtor 



110. 



0 



Marker packets are thus contemporaneously generated and introduced onto the 
network 1 08 by the packet generator component of the packet generator and network 
monitor 11 0 on the rising edge of source test audio signal created by signal generator 



102 



Furthermore, signal generator 102 also acts as a relay of output signals 
generated by multimedia temiinals 106 and 104. When an output signal is regenerated 
by either of multimedia temiinals 104 or 106, a similar trigger signal as described above 
is created by signal generator 102 on receipt of the rising edge of the regenerated audio 
output signal and is sent to the packet generator component of the packet generator and 
momtor 110. Since some multimedia systems can cause inversion of audio signals the 
signal generator 1 02 may be programmed to invert the regenerated audio output signal 
Trigger signals sent from signal generator 102 to packet generator and monitor 1 10 may 
be sent on different signal lines depending upon whether they are input or output signals 
and whether they are from multimedia temunal 104 or 106 to allow the packet generator 
to distinguish the signals and generate an identifiable corresponding marker packet. 
Alternatively, the trigger signals could be distinguished using some other means, such 
as voltage, etc. 

In the preferred embodiment, audio delays can be measured between several 
reference points in the environment 100. For notational puiposes. the reference points 
are referred to as follows: All represents the audio input 1 14 of multimedia temunal 
104. AOI represents the audio output 1 18 of multimedia terminal 104. Ell and EOl 
represents the network input and output respectively of multimedia temiinal 104 In a 
similar manner: AI2, A02, EI2 and E02 represent the coxresponding inputs and outputs 
on multimedia terminal 106. Using this notation, AIl-EOl is the audio delay from the 
analog audio input of the multimedia temunal 104 to the network output of multimedia 
terminal 1 04. Analogously, the other audio delays that can be measured are EI 1 -AO 1 . 
E01-A02, AI2-EI.I, and the end-to-end delays AI1-A02, AI2-AO]. 

•• I" ^othfer embodiment of the pf^^^^^^ 
dMmons,..e. .tParismh^and receive direction, tKroughthc multimedia terminals l-04,and 



9 



1 06 at the same time, two different marker packets are injected into the network, one 
type of marker packet for each direction. As mentioned above^ marker packets can be 
made unique by choosing a specific size for the network packet or by a specific pattern 
of characters and by configuring the network monitoring tool and analysis tool to 
5 recognize and analyze such packets accordingly. 

Returning to Figure 1, as mentioned above, signal generator 102 sends a trigger 
signal imique to the event to the packet generator component of packet generator and 
monitor 1 10 to generate a marker packet. The marker packet can be used to identify a 
particular event such as the origination of a test audio signal or receipt of a regenerated 
10 audio signal. The network monitoring features of packet generator and monitor 1 10 is 
adapted and configured to recognize packets, and ensures that all network packets of 
interest and particularly marker packets and audio stream packets are captured and time- 
stamped. 

After the desired number of measurements are taken and captured by the 
1 5 network monitor component, capturing is stopped, and optionally a hexadecimal trace 
file is produced. 

The time stamp resolution, the delays of propagating signals across the packet 
generation component, and the delay of sending the marker packets will affect the 
measurement accuracy, 

20 Specifically, the signal generator 1 02 and the packet generator and network 

monitor 110 introduce additional delays to signals traveling through these components. 
The signal generator 1 02 transforms an analog signal to a digital trigger signal with a 
certain delay, and monitor 1 1 0 sends a packet to the network with a certain delay after a 
trigger signal is received. These delays may not always be constant but the accuracy of 

25 delay measurements can be improved by taking into account these delays. Figure 2 
illustrates the delays that may be compensated. 

The packet generation delay affects only the packet delay. This delay is added to 
the non-compensated raw packet delay to get a more accurate packet delay. For the end- 
to-end delay calculations, compensation is not necessary because both the initial marker 

30 packet and the final one are delayed by the same amount. The end-to-end delay is 

additi'onally. affected by the' signal generator delay, which is subixaexed.frorri the non- 
compensated end-to-ehd delay. The compensation is carried out by the analysis tool 
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1 12. The packet generator delay is largely dependent on the computing platform used 
for that application. On the other hand, the signal generator delay is more constant and 
it depends on the electronic circuits for audio signal detection and signal shaping. 

The analysis tool 1 12 is configured and adapted to recognize marker packets and 
audio stream packets and calculates the delay values as well as provide statistical 
analysis of the results. It may optionally provide off-line processing of the hexadecimal 
trace file captured by the network monitor. In a preferred embodiment, analysis tool 1 12 
may be embodied in software on a typical personal computer connected by serial port, 
or other well known means, to the network monitor. Optionally, the hexadecimal trace 
file may be stored and later conununicated by other means, such as floppy disk 
memory. The analysis tool 1 12 performs its subtraction measurement by using the 
difference in time between the time stamp on the marker packets and the time stamp on 
the corresponding audio stream packets. In the operation of the preferred embodiment, 
network packets of two types are generated to measure the delay AIl-EOl: (a) a marker 
1 5 packet on the generation of the test audio signal; and (b) a corresponding audio stream 
packet of the test audio signal. The time differential of these two packets can be used to 
calculate the delay through the multimedia terminal 104. When the audio stream packet 
is received by multimedia terminal 106, the regenerated test signal is produced. A third 
network packet, i.e. a marker packet, is generated when the audio is regenerated by 
20 multimedia terminal 1 06 and can be used to determine the network delay and 

multimedia terminal 106 delay. Using the timing information from the audio stream 
packet and the marker packet from the regenerated output audio signal from multimedia 
terminal 106, the delay E0I-AO2 can be calculated. Furthermore, using the two marker 
packets, the delay AI1-A02 can also be calculated. The procedure can also be operated ' 
25 in reverse with the signal generator 1 02 using a test audio signal to audio port 1 20 of 
multimedia terminal 106. In a manner obviously similar to that described above, the 
delays A12-EII, Ell to AOl can be calculated. Similar packets to the procedure above 
are produced in reverse. 

Audio stream packets can be identified on the network in the following manner. 
3 0 Each packet of the captured network traffic is examined, whether it is a marker packet 
or a packet eohtaihing the encoded audio source signal. Deperidiiig on which encoding 
law was used, in th-e system, -A-law. or u-i^w (btjth 6f-vi:hi<?h-afe-well-l0i)!wfe), a15d. ■ 



3NSDOC1D: <GB 2355901A l_> 



11 



depending on the input voltage level, the input pulse will be coded by a certain constant 
value for the duration of the pulse. If the G.71 1 standard is employed, then there are 
8000 samples per second, and the encoding of the samples consists of 8 binary digits 
per sample, and an input pulse of 1ms duration will be encoded by 8 bytes in the 
5 payload of the audio stream packet. Longer input pulses would be coded proportionally 
by more bytes. Because variations of the input signal level can occur, a searching 
algorithm can be employed in the analysis tool which allows for a variation of byte 
values between two extreme values, and looks for a match of a predefined number of 
bytes that are consecutive in the packet payload, at certain offset from the beginning of 

10 the packet frame. The offset in the Analysis Tool 112 can be set to a selected value. 

Using as an example of the usage of RTP protocol for sequencing the audio packets, as 
the H.323 standard suggests, the offset for searching for the encoded pulse in the packet 
would be 54 bytes. This offset includes the MAC, IP, UDP and the RTP header. 

After the successful search for the encoded audio pulse, the raw audio signal to 

15 packet delay is calculated by subtracting the time stamp of the last marker packet from 
the time stamp of the audio stream packet that contains the encoded audio pulse of the 
test audio signal; this.is the raw delay AIl-EOl. The raw voice packet to regenerated 
audio delay (i.e., the raw delay E01-A02) is calculated by subtracting the time stamp 
of the packet that contains the encoded pulse from the marker packet time stamp that 

20 follows the former packet. 

While the time stamps on the marker packets and the audio stream packets can 
be used to calculate the raw delay with a certain level of accuracy, further improved 
measurements can be performed by locating the position of the digitally encoded test 
signal in the audio stream packets as described in further detail with respect to Figure 2. 

25 Consider an application wherein voice packet streaming occurs every 20 ms, i.e. 

every 20 ms an Ethernet packet is sent on the network. The G.71 1 standard specifies 
8000 samples per second, and sample encoding by 8 bits. The 20 ms audio information 
is encoded by 160 samples, i.e. 160 bytes (0.020*8000). The RTP packet payload is 
therefore 1 60 bytes per packet. The total length of an Ethernet voice streaming packet is 

30 thus 218 bytes: 54 bytes header (MAC, IP, UDP and RTP header), 160 bytes payload 
bytes and 4 bytes FCS (Ethernet Frame Check Sequence). The- Ethernet packet starts- 
wdth a preamble followed by the MAC header and all other (IP, UDP, RTP) headers, 
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followed by RTP p.y,oad data, and finally 4e FCS. The neWk ™om.or 110 takes 
«me ..amp mfortnalion afte. fte packe, as a whole (after reeeiving Uie te, by,e of FCS) 
has been received, n-e du«,ion of the packet on a 1 0 Mbit/s network is 0. 1 74ms (i e 
2,8.8.0.0000001). nre audio input signal (Imsin duration) is encoded by 8 bytes The 
packet abstraction of Figure 3A represents fl. 20 ms audio infonnation. which ends 
when a voice packet (i.e. an Etheme, packet that contains the coded audio test (voice) 
signals) is generated (the time st»,p is ^enby the network momtor U). and it starts 
20 n,s earlier. ITte RPT payload bytes at the begi^ung of the payload (left-hand side) of 
Ftgtoe 3A represent audio samples that occurred earlier in the past; the las. byte of the 

payload (right-hand side) of Figute 3A represent, the most recent audio sample 
Because of processing overhead, the multimedia te™„a, 1 04 introduces a delay which 
results .„ shifting the audio input samples (and all the samples) toward .he end of me 
RTP payload (U,e audio inpu. does not appe^ aligned with the encoded audio input in 
.he payload). Shifting the audio input samples (bytes) in the payload causes delays in 
5 regeneration of the audio output signal. 

The audio-to-packet delay is calculated i„ the following manner. Firstly the 
analysis tool 112 calculates the raw audi.to-packet delay as a time difference lietween 
*e tune stamp of the packet contaimng the encoded audio signal and the time stamp of 
fte marker packet preceding that packet (e.g. 17ms). Nex^ the analysis tool 1 12 detects 
tte posmon of the beginmng of the encoded audio input signal from the end of the 
payload. which can be e.g. 48 (maximum 160). Each byte represents a time duration of 
0.125n,s (1/8000S), i.e. 1ms audio signal is encoded by 8 bytes. 48 bytes are converted 
.o a nme value of 6ms, i.e. 48.0.,25ms. ,„e packe. delay introduced by the multimedia 

™ '"^ "direction is calculated by subtracting this time value ' 

ftom the raw packet delay, i.e 17ms - 6ms - 1 Ims. 

In the receive direction, the analysis tool 1,2 flrst calculates the raw packet-to- 
audto delay by subtracting the time stamp of dre marker packet that follows the Ethernet 
packet contaitung the audio signal and the ,ime stamp of the Ethernet packet containing 
*e aud,o signal (e.g. 40 ms). Next, the analysis tool detects the position of the 
begrnmng of the encoded audio signal ftom the beginning of the RTP pavload, which 
can oe e.g. 56 (mMmutn 1 60). 56 sample i„,e„als <r^!,^s into a time value of 7ms ' ' 
,(o6.0.13=ms). The,packe,-to-audlo'delayi«roduecd by,hemul,ir„edia,ermlnar,04 " 
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(106) in the receive direction is calculated by subtracting this time value from the raw 
packet-to-audio delay, i.e. 40ms - 7ms = 33ms. 

In order to better illustrate the extent of the introduced delays, Figure 3B shows 
a scenario wherein the multimedia terminals 104 and 106 introduce no additional 
5 delays. Even if the multimedia temiinals do not introduce delays, end-to-end delay still 
exists as a result of buffering (20 ms) and delays introduced by the network. 

Figure 4 shows an altemate embodiment of the present invention. This alternate 
embodiment is somewhat simpler and takes measurements only in one direction at a 
time across the multimedia terminals 104 and 106. The role of the signal generator 102 

10 of Figure 1 is divided into a pulse generator 302 and a trigger circuit 304 in this 

altemate embodiment which simplifies the components of the inventive system. To get 
measurements in both transmit and receive directions across a multimedia terminal, the 
measurement has to be carried out twice; once feeding the audio input source signal 
from pulse generator 302 into audio input 1 14 of the multimedia terminal 104, and for 

1 5 the second measurement feeding the input signal into audio input of the multimedia 
terminal 106. The pulse generator 302 also sends the test audio signal to the trigger 
circuit 304 which in turn issues a trigger signal to packet generator and network monitor 
11 0 to issue a marker packet. In a simpler maimer as described with respect to Figure 1 , 
trigger circuit 304 also issues a marker packet on receipt of a regenerated audio signal 

20 from multimedia terminal 106. In other respects, this altemate embodiment operates in a 
manner obviously similar to that previously described with respect to Figure 1 . 
Drawbacks of this simpler variant are that the measurement takes longer time to 
complete, and that no correlation of delays in transmit and receive direction across the 
multimedia terminals 104 and 106 at the same time can be given. 

25 A second alternative embodiment is depicted in Figure 4A wherein a loopback 

circuit is provided at the remote multimedia terminal, whereby the regenerated audio 
test signal is looped back through a loopback circuit electronically, acoustically or using 
a loopback feature of the multimedia terminal. Preferably this embodiment can be used 
in the field, where a loopback circuit at points A02-AI2 may be installed, possibly even 

30 by a non-technician. 

The benefit of this embodiment is that .without^haVing to provide an extra 
■physical connection tO'theTemote multimedia terminal, delays across the local 
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multimedia terminal can be determined as well as the loopback delay, which can be 
used to estimate the end-to-end delay. The loopback delay consists of the end-to-end 
delay from the local multimedia terminal to the remote multimedia terminal and 
additionally the end-to-end delay from the remote multimedia temiinal to the local 
multimedia terminal where the measurement equipment is located. 

The embodiment of Figure 1 assumes that the multimedia terminals 104 and 106 
are located in close proximity to be able to connect both temiinals to the signal 
generators 102. It is also possible to apply the principles of the invention remotely with 
minor modifications to the embodiment of Figure 1. Figure 2 is similar to Figure 1 with 
the exception of additional components in the form of signal generator and detector 130 
and telephones 132 and 134 connected over a telephone network (TN). 

Multimedia terminals 1 04 and 1 06 are accessed remotely, using manual or 
automated dialing and the phone calls placed thereby are answered either manually or 
using auto-answer capability. The signal generator and detector 130, as well as signal 
generator 102, are provided telephone line access capability. The dotted line between 
reference points All and AOl represent a loopback capability. This loopback capability 
is necessary to measure the delay over the telephone lines from the signal generator 102 
to the remote signal generator and detector 130 at the multimedia terminals 104 and 106 
and back to signal generator 1 02. The delay from AI to All (or A2 to AI2) is one-half of 
20 the measured loopback delay, assuming a circuit switched connection. The loopback 
measurements can be carried out using the arrangement shown in Figure 5. 

The purpose of the signal generator and detector 1 30 is to detect the transmitted 
audio signal from signal generation 102 and generate a pulse signal to All (A12) analog 
input of the multimedia terminal 104. The packet generator and network monitor 110 is " 
25 connected at a desired access point to the network. The packet delay measurements also 
include the delay (Al-AIl/AOl-Al and A2-AI2/A02-A2) over the telephone lines. 

Although the invention has been described in terms of the preferred and several 
alternate embodiments described herein, those skilled in the art will appreciate other 
embodiments and modifications which can be made without departing from the spirit 
3 0 and scope of the teachings of the invention. All such modifications are intended to be 
■■ included within the- ^c'ope of the claim's appended hereto.- "■ • . . ■ 
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Claims: 

1 . A system for measuring delays across equipment in a packet network 
comprising: 

(a) a signal generator for generating test audio signals and trigger 
signals; 

(b) one or more network terminals coupled to said network and said 
signal generator for receiving said test audio signals, coding and 
decoding audio stream packets, transmitting and receiving said audio 
stream packets on said network and regenerating audio signals; 

(c) a packet generator coupled to said network and said signal 
generator for generating marker packets on said network in response to 
said trigger signals. 

(d) a network monitor coupled to said network for capturing said 
marker packets, said audio stream packets and recording said packets to 
create captured data; and 

(e) a packet analyzer coupled to said network monitor for analyzing 
said captured data to generate and display measurements of delay. 

2. The system of claim 1 wherein said network uses Ethernet protocols. 

3 . The system of claim 1 wherein said packet generator and said network monitor 
are combined into a single device. 

4. The system of claim 1 wherein said test audio signal is a pulse of approximately 
1 millisecond duration. 

5. A system for measuring delays across equipment in a packet network 
cornprising: 

(a) a pulse generator for generating test audio signals; 

(b) one or more multimedia terminals coupled to' said riet\vofk and 
sai.^ -pulse generator for receiving said test- audio signals, coding and' 
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decoding audio stream packets, transmitting and receiving said audio 
stream packets on said network and creating regenerated audio signals; 

(c) a trigger circuit coupled to said pulse generator and said 
multimedia terminals for receiving regenerated audio signals and test 
audio signals and generating trigger signals; 

(d) a packet generator coupled to said network and said trigger 
circuit for generating marker packets on said network in response to said 
trigger signals; 

(e) a network monitor coupled to said network for capturing said 
marker packets and said audio stream packets and recording time stamps 
to create captured data; and 

(f) a packet analyzer coupled to said network monitor for analyzing 
said captured data to generate and display measurements of delays. 

The system of claim 5, further comprising a loopback circuit for feeding back 
said regenerated audio signals from an output to an input of said one or more 
multimedia terminals. 

A method for measuring delays across equipment in a packet network 
comprising the steps of: 

(a) generating a first marker packet on said network 
contemporaneously with the generation of a test audio signal to a 
network terminal; 

(b) generating an audio stream packet by said network terminal on 
said network containing an encoded audio signal corresponding to said 
test audio signal; 

(c) capturing said first marker packet and said audio stream packet 
with associated time stamps corresponding to moment of capture; and 

(d) subtracting said associated time stamps corresponding to said 
first-marker.packet- and said audio stream packet to determine raw. delay 

- across said network terminal. . " ; 
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The method of claim 7 comprising the further steps of: 

(a) analyzing said audio packet to determine time offset of said 
encoded audio signal from end of said audio stream packet; and 

(b) subtracting said time offset from said raw delay to obtain net 
delay across said network terminal. 

A method for measuring delays across equipment in a network comprising the 
steps of: 

(a) generating a first marker packet on said network 
contemporaneously with the generation of a test audio signal to a first 
network terminal; 

(b) generating an audio stream packet by first network terminal on 
said network containing an encoded audio signal corresponding to said 
test audio signal; 

(c) regenerating said test audio signal on a second network terminal 
in response to receipt of said audio stream packet on said second network 
terminal; 

(d) generating a second marker packet on network 
contemporaneously with said regenerating of test audio signal on said 
second network terminal; 

(e) capturing said first marker packet and said second marker packet 
with associated time stamps corresponding to moment of capture; and 

(f) ~ subtracting said associated time stamps corresponding to said 

first marker packet and said second marker packet to determine end-to- 
end delay across said network. 

A method for measuring delays across equipment in a network comprising the 
steps of: 

(a) generating first marker packet on said network 

contemporaneously with the generation of a test audio signal to a first 
network terminal; 

(b.) generating ah audio stream packet by first nerA^ork terfnmal oh. 

said. netv.:prk- containing^ ail -enc^^ corresDondiR-^ to siid 
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test audio signal; 

(^) regenerating said test audio signal on a second network terminal 

in response to receipt of said audio stream packet on said second network 
terminal; 

(d) generating a second marker packet on network 

contemporaneously with said regenerating of test audio signal; 

(^) capturing said first marker packet, said second marker packet and 

said audio stream packets with associated time stamps corresponding to 
moment of capture; 

subtracting said associated time stamps corresponding to said 
first marker packet and said second marker packet to determine end-to- 
end delay across said network; and 

(g) subtracting said associated time stamps corresponding to said 

second marker packet and said audio stream packet to determine raw 
delay across said second network terminal. 

The method of claim 10 comprising the further steps of: 

(^) analyzing said audio stream packet to determine time offset of 

said encoded audio signal from start of said audio stream packet; and 
subtracting said time offset from said raw delay to obtain net 
delay across said second network terminal. 
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